Doctoral Thesis Proposal Automatic Detection and Classification of Prosodic Events

نویسنده

  • Andrew Rosenberg
چکیده

Speech prosody is a valuable carrier of information. Accents and phrase boundaries have been shown to contribute to syntactic disambiguation, semantic, pragmatic and paralinguistic interpretation, and to convey information about topicality, focus, contrast and information status. This thesis will present and evaluate techniques to detect and classify these prosodic events. The acoustic correlates of accents, phrase boundaries and phrase-final tones will also be examined. Spoken language processing systems have not made widespread use of prosodic information. We hypothesize that access to this information should improve the performance of many SLP applications. To support this, we will present proof-of-concept examples integrating hypothesized prosodic event information into speech synthesis, story segmentation, extractive summarization, and prosody tutoring applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Detection and Classification of Prosodic Events

Automatic Detection and Classification of Prosodic Events Andrew Rosenberg Prosody, or intonation, is a critically important component of spoken communication. The automatic extraction of prosodic information is necessary for machines to process speech with human levels of proficiency. In this thesis we describe work on the automatic detection and classification of prosodic events – specificall...

متن کامل

Studies on Bird Vocalization Detection and Classification of Species

Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Seppo Fagerlund Name of the doctoral dissertation Studies on Bird Vocalization Detection and Classification of Species Publisher School of Electrical Engineering Unit Department of Signal Processing and Acoustics Series Aalto University publication series DOCTORAL DISSERTATIONS 166/2014 Manuscript submitted 12 June 2014 Date o...

متن کامل

Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues

We investigate automatic approaches to finding “hidden” spontaneous speech events, such as sentence boundaries and disfluencies, in multi-party meetings. Hidden events are characterized prosodically by a large array of automatically extracted energy, duration, and pitch features, and are modeled by decision tree classifiers; lexical cues are modeled by N-gram language models. Both sources of in...

متن کامل

Automatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique

The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...

متن کامل

Automatic Punctuation and Disfluency Meetings Using Prosodic An

We investigate automatic approaches to finding “hidden” spontaneous speech events, such as sentence boundaries and disfluencies, in multi-party meetings. Hidden events are characterized prosodically by a large array of automatically extracted energy, duration, and pitch features, and are modeled by decision tree classifiers; lexical cues are modeled by N-gram language models. Both sources of in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007